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The viral nature of the first "giant 
virus," Mimivirus, was realized 
in 2003, 10 y after its initial isolation 
from the water of a cooling tower in 
Bradford, UK. Soon after its genome 
was sequenced, the mining of the 
Global Ocean Sampling environmen- 
tal sequence database revealed that the 
closest relatives of Mimivirus, only 
known to infect Acanthamoeba, were 
to be found in the sea. These predicted 
marine Mimivirus relatives remained 
elusive until 2010, with the first genomic 
characterization of a virus infecting a 
heterotrophic unicellular eukaryote, the 
microflagellate grazer Cafeteria roenber- 
gensis. The genome analysis of a virus 
(PgV) infecting the common unicellular 
algae Phaeocystis globosa now shows that 
it is a bona fide member of the Mimivirus 
family (i.e., the Megaviridae), extending 
the realm of these giant viruses to abun- 
dant blooming phytoplankton species. 
Despite its smaller genome size (460 kb 
encoding 434 proteins), PgV exhibits the 
most intriguing feature of the previously 
characterized Megaviridae: an associ- 
ated virophage. However, the 19-kb viro- 
phage genome, devoid of a capsid gene, is 
packaged in the PgV particle and propa- 
gated as a "viral plasmid," the first ever 
described. The PgV genome also exhibits 
the duplication of "core genes," normally 
present as single copies and a putative 
new type of mobile element. In a DNA 
polymerase phylogeny including repre- 
sentatives of the three cellular domains, 
PgV and the other Megaviridae cluster 
into their own clade deeply branching 
between domains Archaea and Eukarya 



domains, thus exhibiting the topology of 
a fourth domain in the Tree of Life. 



"Giant viruses"' were initially defined on 
the basis of their particle (virion) size — 
more than 0.5 JJtm in diameter — (making 
them observable under a light micro- 
scope) incorporating a DNA genome 
more than a million base pairs (Mb) in 
length, encoding about 1000 proteins 
(hence the proposed name "Megaviridae" 
for their family). Most of them replicate 
in Acanthamoeba including 4 that have 
been fully sequenced^'' (Table 1). Their 
replication entirely proceed in the host 
cytoplasm through the initial building of 
a large virion factory within which their 
genome is replicated and transcribed, and 
from the periphery of which new particles 
emerge at a later stage.'' '* Long after the 
bioinformatics prediction that Mimivirus 
relatives should be abundant in the sea,'''" 
the first Megaviridae family member not 
infecting an Acanthamoeba (classified in 
the Amoebozoa, Unikonta) was identified 
in Cafeteria roenbergensis, a heterotrophic 
unicellular plankton species, belonging to 
a very distant phylum (Heterokontophyta, 
Bikonta)." Despite its smaller genome 
(about 700 kb), Cafeteria roenbergensis 
virus (CroV)'^ shared a large proportion 
of best matching orthologous proteins 
with Mimivirus, and possessed many of 
its unique features (Table 1), including its 
own type of virophage,'^ a specific type of 
mismatch repair protein (MutS7),''' and 
an amino-acyl tRNA synthetase, a type 
of enzyme normally specific of cellular 



www.landesbioscience.com 



Communicative & Integrative Biology 



e25685-1 
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organisms." The genome analysis of PgV- 
16T,"' a virus known to regulate the popu- 
lation of the bloom-forming microalgae 
Phaeocystis globosa^^ (Haptophyta, Bikonta) 
now indicates that the Megaviridae fam- 
ily includes viruses infecting dominant 
phytoplankton species. With its 459 kb- 
genome encoding 434 proteins, PgV-l6T 
is the most complex virus known to infect 
a photosynthetic organism. 

Virophages might be common 
in the viral world 

Although global gene content and detailed 
sequence similarity comparisons with the 
previously described Megaviridae mem- 
bers clearly classify PgV within the fam- 
ily (including the partially sequenced 
Organic Lake Phycodnaviruses (OLPV), 
Chrysochromulina erecina virus, and 
Phaoecystis pouchetti virus),'' the find- 
ing of a PgV-associated virophage came 
as a surprise. Eight complete virophage 
genomes have now been described, but 
only 3 correspond to identified and isolated 
"hosts" — the 2 closely related "Sputnik" 
1 and 2"*'" infecting Mimivirus, and 
"Mavirus" infecting CroV.^° The PgV 
virophage (PgW)"" is the third, and pre- 
sumably not the last, of a rapidly growing 
series. ^' "^ Except for their genome size in 
the 20 kb range, these new types of sat- 
ellites viruses have little in common in 
terms of gene content, although they all 
code for a major capsid protein and one 



DNA primase.'^'^^ Consistent with the 
fact that it was never observed in PgV- 
infected cultures, PgVV is the first exam- 
ple of a virophage lacking the information 
to make a capsid protein, the essential 
building block of a virus particle. Once 
multiplied in the PgV virus factory, the 
PgVV genome is thus packaged (in mul- 
tiple copies"') alongside the PgV genome, 
and propagated through the PgV virion, 
either as an integrated or free viral "plas- 
mid."'' The precise molecular structure (s) 
that can be adopted by the PgVV genome 
are under further studies. 

The finding of a virophage associated 
to PgV is already teaching us the impor- 
tant lesson that virophages are not solely 
associated to DNA viruses with micron- 
sized particles and 1 Mb genome sizes, but 
can be found with large DNA viruses of 
more reasonable (Poxvirus-like?) propor- 
tion. It is thus likely that many of such 
associations have been overlooked in the 
past, and that virophages might have 
played a fundamental role in the evolu- 
tion of many more viruses than just the 
Megaviridae. If this is true, they could be 
(and have been) the main vehicle of gene 
transfers between Eukaryotic viruses, and 
indirectly between these viruses and their 
hosts. They may also be responsible for the 
sporadic occurrences of mobiles elements 
such as self-splicing introns, inteins, and 
transpovirons." 

Another lesson is that, like all para- 
sites, the virophages are submitted to the 



irreversible phenomenon of reductive evo- 
lution, condemning them to disappear as 
individual biological entities, eventually 
saving some of their genes by integrating 
them into the genome of their companion 
virus, themselves undergoing a similar 
process vis-a-vis their cellular host. 

PgV clusters with the previously 

described Megaviridae in an 
apparent "fourth domain" in the 
tree of life 

The first published phylogenetic tree 
including Mimivirus 2 and using a con- 
catenation of 7 universally conserved 
protein sequences, already pointed out 
that: "we could now build a tentative 
tree of life, within which Mimivirus 
appears to define a new branch dis- 
tinct from the three other domains."^ 
Elsewhere in the same article, Mimivirus 
was also shown to be part of the broad 
family of the nucleocytoplasmic large 
DNA viruses (NCLDV), branching near 
the middle of the previously defined 
Iridovirus, Phycodnaviruses, Poxviruses, 
and Asfarviruses lineages. Put together, 
these 2 results suggested in a sublimi- 
nal way that all the large DNA viruses 
were in fact defining a domain distinct 
from the 3 established cellular domains. 
As additional genomes of Megaviridae 
became available, molecular phyloge- 
nies computed with an increasing num- 
ber of universal proteins associated to 
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Figure 1. Simplified Tree of Life including the 6 fully sequenced megaviridae. The tree was produced using the default option on the IMAFFT server 
(http://mafft.cbrc.jp) from the multiple alignment of 25 DNA polymerase B sequences (510 ungapped positions, excluding the inteins). Branches with 
bootstrap values < 80 are collapsed. Despite infecting eukaryotic hosts from vastly divergent phyla, the viruses do not show any phylogenetic affinity 
with a specific eukaryotic group, and cluster (in red) separately from the 3 cellular domains: Eukarya (green), Archaea (purple), and Eubacteria (blue). 
This strongly supported topology suggests that the common ancestor of the Megaviridaelargely predated the radiation of the Eukaryotes (CeV: 
Chrysochromulinaericina virus, unpublished). 



basic functions (DNA clamp loaders, 
Ribonucleotide reductases, Aminoacyl- 
tRNA synthetases, DNA polymerases) 
kept clustering the megaviruses in their 
own clade, clearly separate from the cel- 
lular domains, thus contributing an 
additional domain rooted in between 
Archaea and Eukarya in the Tree of 
Life.'*'^'''"' Despite receiving increas- 
ing support from various authors, 
others remain strongly opposed to this 
view.^'''" In agreement with the notion 
of a fourth domain of life anchored by 
the largest DNA viruses, the DNA poly- 
merase of PgV nicely clusters with the 
other Megaviridae homologs, exhibiting 
no affinity with any of the main cellular 
lineages (Fig. 1). Based on the presence 
of a vestigial protein translation system, 
I previously argued that the genome of 
today's giant viruses derived from an 
ancestral cellular organism through the 
irreversible process of reductive evolution 
experienced by all parasites. DNA viruses 
exhibiting a whole range of genomic 



complexity could have been generated 
through this continuous process: viruses 
would have lost translation first (the 
ribosome), then transcription (the RNA 
polymerase), then DNA replication (The 
DNA polymerase), following an evolu- 
tionary scenario whereby they become 
increasingly dependent from their host." 
The vast range of size and complexity 
among today's DNA viruses might thus 
be the results of differences in their evo- 
lution rates (the largest ones experienc- 
ing the least evolutionary pressure). In 
this context, the lack of affinity of the 
giant viruses with any of today's cellular 
domains, as well as the huge proportion 
of the viral genes without cellular homo- 
log, would suggest that the lineage of the 
ancestral cellular organism that gave rise 
to giant viruses became extinct as a cel- 
lular life form, and only managed to sur- 
vive as a parasitic "fourth domain." The 
recently described giant Pandoravirus 
might represent an independent instance 
of the same scenario." 
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